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Abstract. Given a group G, the model Q(G,p) denotes the probabihty space of all Cayley 
graphs of G where each element of the generating set is chosen independently at random with 
probability p. 

Given a family of groups {Gk) and a c £ R+ we say that c is the threshold for diameter 
2 for (Gk) if for any e > with high probability T £ Q{Gk,p) has diameter greater than 2 if 

p ^ yj(c — e) and diameter at most 2 if p y^(c + e)^^^. In [ ] we proved that if c is a 
threshold for diameter 2 for a family of groups (Gk) then c G [1/4, 2] and provided two families 
of groups with thresholds 1/4 and 2 respectively. 

In this paper we study the question of whether every c G [1/4, 2] is the threshold for diameter 
^ 2 for some family of groups. Rather surprisingly it turns out that the answer to this question 

is negative. We show that every c £ [1/4,4/3] is a threshold but a c £ (4/3, 2] is a threshold if 
and only if it is of the form 4n/(3n — 1) for some positive integer n. 



1. Introduction 



o 
u 

^ Let us begin by recalling that given a group G and a subset 5 of G, the Cayley graph 

r = T{G;S) of G with respect to S has the elements of G as its vertex set and has an edge 
^ between g and h if and only if hg~^ G 5 or gh~^ G 5. We ignore any loops or multiple edges. 

I— ' In particular, whether 1 S S" or not is immaterial. Observe for example that F is connected if 

I and only if the set S generates the group G. Throughout the paper, we will often refer to the 

J> set S as the generating set of the graph T irrespectively of whether it is a generating set for the 

group G or not. 

The model G{G,p) is the probability space of all graphs T{G; S) in which every element of 
G is assigned to the set S independently at random with probability p. We refer the reader 
to [ ] for a discussion of some similarities and differences between this model and various other 
models of random graphs. 

CN In this paper we continue the study of the diameter of random Cayley graphs that was 

^ initiated in [o]. Given a family of groups (Gk) of orders Uk with — ?■ oo as A; — )■ oo and 

¥^ a c G M+ we say that c is the threshold for diameter 2 for (Gk) if for any e > with high 

>< probability T e G{G,,p) has diameter greater than 2 if p ^ Ji^^^ and diameter at most 
H I ^ 

^ 2 if p ^ W (c + In [ ] we proved that if c is a threshold for diameter 2 for a family of 

groups {Gk) then c G [1/4,2]. Moreover this is best possible as 1/4 is the threshold for the 
family of symmetric groups Sn while 2 is the threshold for the family of products of two-cycles 

{C2T- 

Given the above result it is natural to ask whether for every c G [1/4,2] there is a family 
of groups for which it is a threshold for diameter 2. Rather surprisingly, it turns out that the 
answer is negative. The aim of this paper is to give a characterisation of the values of c that 
can appear as thresholds. 
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Theorem 1.1. Let c £ [1/4,2]. Then c is the threshold for diameter 2 for a family of groups 
(Gk) if and only if c G [1/4, 4/3] U : n G N}. 

We break the proof of Theorem 1.1 into the following results. 

Theorem 1.2. Letc G [1/4,1/2). Then c is the threshold for diameter 2 for the family SnX Cm, 

2c 

where m = [nl i-2c J . 

Theorem 1.3. Let c £ [1/2,1). Then c is the threshold for diameter 2 for the family G = 

(l-c)n 

{G2T X (C™) where m = [2^^J. 

Theorem 1.4. Let c G [1,4/3). Then c is the threshold for diameter 2 for the family G = 

(4-3e)n 

(C2)" X {D2m) where m = [2 as J and is the dihedral group of order 2m. 

Theorem 1.5. Suppose c > 4/3 is the threshold for diameter 2 for some family of groups. Then 
CG{ 

3n-i '■ n gN}. Moreover, given a positive integer n, g^zrf is the threshold for diameter 2 
for the family {{C2)'' x D4n)k- 

It is evident that Theorem 1.1 follows from Theorems 1.2-1.5. These results will be con- 
sequences of the more general Theorem 2.1 which enables us to determine the threshold for 
diameter 2 for a family of groups provided we have enough information about some specific 
structure of the groups of this family. 

As we mentioned this looks like a surprising result as one would expect that by taking 
'suitable combinations' of groups with different thresholds, one should be able to interpolate to 
get all possible values between the two thresholds. This is exactly what happens in Theorem 1.2. 
Symmetric groups have threshold 1/4, cyclic groups have threshold 1/2, and by taking a suitable 
direct product we can get any threshold in the interval [1/4, 1/2]. However there is a natural 
reason why this result is not so surprising after all. As one would expect the result depends on 
various group properties and there are many such properties for which you cannot interpolate. 
We are thinking here of properties which say that when the group is 'too close to being abelian' 
then it actually is abelian. For example it is a trivial observation that if more than half of the 
elements of a group are central (i.e. they commute with every other element) then the group is 
abelian and so all elements are central. (In fact the same holds even when more than a quarter 
of the elements are central.) In a similar manner, if more than 5/8 of pairs of elements commute 
then all pairs commute [(i, ] and if less than 7|G|/4 pairs of elements are conjugate then no pair 
of distinct elements are conjugate ['}]. In our situation the important group theoretic property 
which affects the final outcome is the percentage of involutions (i.e. elements which square to 
the identity) of a finite group. It turns out that in the interval [1/2, 1] there are only finitely 
many values which can appear as proportion of involutions [ ] and this is exactly what causes 
the set of thresholds in the interval (4/3, 2] to be discrete. 

The plan of the paper is as follows. In Section 2 we define certain dependency graphs of a 
group G. Their structure and more specifically the number of their edges will be the key factor 
to decide whether c is the threshold for a family of groups or not. This is formulated precisely 
in Theorem 2.1. The proof of this theorem will naturally split into two cases. The case where p 
is below the threshold in which we show that the diameter of the random Cayley graph is with 
high probability greater than 2, and the case where p is above the threshold in which we show 
that the diameter of the random Cayley graph is with high probability at most 2. We consider 
these cases in Sections 3 and 4 respectively. Finally in Section 5 we apply Theorem 2.1 to prove 
Theorems 1.2-1.5. 

2. The dependency graphs for distance greater than 2 from the identity 

Let G be a finite group and let us denote its identity element by 1. For each non-identity 
element x of a group G, we define the graph whose vertices are the elements of G and where 
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g is adjacent to h if and only if /i G {xg,xg~^,x~^g,x~^g~^^gx,gx~^,g''^x,g~^x~^}. Here we 
do allow loops but we ignore multiple edges. 

The reason for the definition of this graph is the following: If S" is a subset of G then x is 
at distance greater than 2 from the identity in r(G; S) precisely when x,x~^ ^ S and Tx[S] is 
independent. Indeed, x is a neighbour of the identity if and only if a; G S" or G S and x is at 
distance 2 from the identity if and only li x,x~^ ^ S and furthermore there is a y such that 1 is 
a neighbour of y and y is a neighbour of x in V{G; S). The latter happens if and only if at least 
one of y,y~^ and at least one of yx~^,xy~^ belong to S. Setting g to be equal to one of these 
four expressions and setting h to be equal to one of the other two we get that this happens if 
and only if /i G {xg,xg~^ ,x~^g,x~^g~^ ,gx,gx~^ ,g~^x,g~^x~^}. 

We now make a few simple observations about these dependency graphs 

(01) Vx has maximum degree at most 8. 

(02) Every vertex of is incident to an edge which is not a loop. 

(03) Tx has a loop at y if and only if a; = or a: = y~^. 

(04) For every two distinct elements g,h oi G there are at most eight distinct F^^'s in which 
g and h are adjacent. 

Given a graph F with loops but no multiple edges we write L{T) for the set of loops of F and 
E{T) for the set of edges of F which are not loops. We also write i{T) and e(F) for the sizes of 
the corresponding sets. 

For a group G, and an element x of G distinct from the identity we write 

gcix) = \G\ IGI . 

Also, if = 1 then we set 

haix) = e-/^^(^^) 

while if x^ 7^ 1 then we set 

Given a positive real number c we write 

/g(x;c) = gGixyhGix)"^. 

Finally, for a graph G and a positive real number c we write 

fG{c)= ^ fG{x;c). 
xeG\{i} 

As the group G will usually be understood from the context, we will often ignore the dependence 
of G and write g{x),h(x), f{x;c) and /(c) instead. It turns out that this last quantity is the 
one which determines the threshold for diameter 2 in random Cayley graphs. 

Theorem 2.1. Let c be a positive real number and let {Gk) be a family of groups whose orders 
tend to infinity. Then c is the threshold for diameter 2 for this family if and only if for every 
e > we have that fa^. (c — e) — >■ oo and fo^ (c + e) — >■ as A; — >■ oo. 

This is a direct consequence of the following two results. 

Theorem 2.2. Let c be a positive real number and let (Gk) be a family of groups whose orders 
tend to infinity. Suppose that for every e > we have that fa^ic — e) — )■ oo as k ^ oo. Then for 

every e > 0, with high probability the diameter ofT^Q ^Gk, l^fcl ^ ^ greater than 2. 

Theorem 2.3. Let c be a positive real number and let (Gk) be a family of groups whose orders 
tend to infinity. Suppose that for every e > we have that fGf,(c + £) — )■ as k oo. Then for 

every £ > 0, with high probability the diameter ofVEQ (Ck, \f^^^^^^^^^ > '^^ most 2. 
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We proceed in the next two sections to prove these two theorems. The main ideas of the 
two proofs akeady appear in [ ] and more specificahy in the proofs of Theorems 1.3 and 1.6 
respectively. 

3. Proof of Theorem 2.2 

In the proof we will use the following result known as Kleitman's Inequality. 

Theorem 3.1 (Kleitman's Inequality). Let 0, be a finite set and let {Fi}i^j be subsets of Q, 
where I is a finite index set. Let R be a random subset of 0, and for each i £ I let Ei be the 
event that Fi <^ R. Then 




We refer the reader to [ ] for a proof of the inequality and continue with the proof of Theo- 
rem 2.2. 

Let G = Gk where k is large enough so that all of the following calculations hold. We write 
n for the order of G. 

For any element x of G, let us write for the event that x has distance greater than 2 from 
the identity. Suppose first that 7^ 1. Then we have that 

This follows directly from Kleitman's Inequality since 

Bx = Ex^E^-i Pi (EyCiEy-i) Pi Ee, 

{y:y^=x} eeE(r^) 

where for an element z of G, Ez denotes the event that z appears in the generating set and for 
an edge e of Tx, Ee denotes the event that both of its incident vertices appear in the generating 
set. Thus, provided k and therefore n is large enough, we have 

= -h{x) ^-p g{x) ^-p'^ 

^ h{x)V^^g(xr~^/^ = ^f{x; c - e/2). 

X 

Here we have made use of the inequality (1 — x) ^ e 1-^ which holds for all x G [0, 1). 
If = 1, then for each y with = x we have = x~^ = x as well, so 

P (E^n:^)= p E-y. 

{y:y'^=x} {y-y^=x} 

So in this case we again have 

= -h{x) ^-p g{x) ^~p^ 

^ h{x)V^^g{xy'e/^ = lf{x-c-e/2) 

Let us now write X for the number of vertices of G which are at distance greater than 2 from 
the identity. Then 

EX = ^ Y>v{Bx) ^ ^fcic - e/2) ^ 00. 
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However this is not enough to show that X > with high probabihty. The reason is that 

the some pairs of these events are highly correlated. For this reason, in general we will have 
Var(X) 3> (EX)^ and so the second moment method cannot be used directly. To overcome this 
difficulty, we will give a quantitative result of the fact that if and By are highly correlated 
then the graphs and Ty have many common edges. Having proved this result, the next task 
will be to cleverly choose a subset / of the elements of G such that the events {Bx)xei 
'almost' independent and furthermore it is still the case that Pr(i?2;) — )• oo. 

Lemma 3.2. Let x, y be distinct elements of G such that the graphs and Ty have exactly r 
common edges. Suppose furthermore that either iiT^) = o{y^) or i{Ty) = o{y^). Then 

FriB^nBy) ^ (l + o(l))Pr(A)Pr(5?;)(l 

Proof. We may assume that i{Ty) = Let us write ei, . . . , for the common edges of 

Tx and Fy and let e^+i, . . . , be all other edges of Ty. By (03) at most two of ei, . . . , are 
loops. For each 1 ^ i ^ s, let Q be the event that not both vertices incident to Cj appear in 
the generating set and let Di = n*^^Cj. Since Dg = By then 

p.(B. n B.) = p.(B.) P.(B,)£gyj> = MB,) MB,) n i^fjfly 

where by convention Pr(Bx\Do) := Fi:{Bx). 

By the law of total probability, for each 1 ^ i ^ s we have 

Pr(A n A-i) Pr{Bx n A-1 n d) + Pr( A n A-i n Cl) 



Pr(AlA-i) 



Pr(A-i) Pr(A- 



Pr(A| A-1 n Gi) Pr(A-i n d) ^ Pr(A n A-i n Gi[ 



= Pr(A|A)Pr(Q|A-i) + 



Pr(A-i) Pr(A-i) 

Pr(A n A-1 n Cl) 



Pr(A-i) 



To bound the first term, we use Kleitman's Inequality to the events Q and A-i to get 

Pr(A|A)Pr(Ci|A-i) > Pr(A|A)Pr(a) 

_ j (1 -p)Pr(AlA) if ej is a loop, 
~ I (1 - p2) Pr( A I A) otherwise. 

To bound the second term, observe first that if 1 ^ i ^ r then A ^ Cj and so Pr(A H A-i H 
Ci) = 0. If r + 1 ^ z < s, let Xj denote the event that no vertex adjacent to a vertex of e, 
in appears in the generating set and Yi the event that for each j < i if ej meets ej then 
the vertex incident to ej but not Cj is not in the generating set. Then, A H -Dj-i fl Cj = 
A n Di-i Cl Gi n Xi nYi. Observe now that whether the two vertices incident to the edge Cj 
appear in the generating set or not does not affect the outcome of the event A H A-i H fl 1^, 
i.e. the events A-H A-inXiHYi and Gi are independent. Applying now Kleitman's Inequality 
to the events A U A-ij Xj and Yi we get 

Pr(A n A-1 n a) = Pr(a) Pr(A n A-i nXiD Y) 

^ Pr(a) Pr(X) FviY) Pr(A n A-i) 

fp(l - p)28 Pr(A| A-i) Pr(A-i) if e, is a loop, 
\p^{l - p)'^^FT{Bx\Di^i)PT{Di_i) otherwise. 



To see the last inequality, recall that by (01) Tx and Ty have maximum degree at most 8 and 
so each of the events Xi and Yi says that at most 14 elements of G (at most seven for each 
vertex incident to Cj) do not appear in the generating set. 
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It follows that if 1 ^ ? ^ r then Pr(B^.|A) ^ Pr(S^| - P^) unless Cj is a loop in 
which case we have Y'x{Bx\Di) ^ Pr(i?2.|L'.j_i)/(l — p). Also ifr + l^i^s then 

Pr(i?,|A) / l-p2(l_p)28 ^ ^ ^ ^ 



^ , ^ ^ ' ^ 1 + 29p'' 



Pr(B,|A-i) 

provided that n is large enough unless is a loop in which case we have 

Pr(S^lA) l-pil-pf^ o 
Pr(5,|A-i) 1-p 
provided n is large enough. So recalling that at most two edges with 1 ^ i ^ r are loops and 
at most o([^^) edges are loops in general we get 

Pr(S, n By) ^ Pr(S,) VT{By){l-p)-\l-p')-'{l + 29/)^(^«)(l + 29p3)4- 



:l + o(l))Pr(S,)Pr(BJ,)(l-^)2^ 



as p £{Ty) = o(l) and p'^n = o(l). □ 

The next task is to choose a 'large' set / such that the events Bx for x G I are 'almost 
independent' in the sense of Lemma 3.2. To this end we define a new graph H with vertex set 
the elements of G excluding the identity in which x and y are adjacent if and only if Tx and Ty 
have at least n^~^ common edges. (Later, we will set 6 = e/8.) Since by (01) Tx has at most 
4n edges and by (04) every edge of Fx belongs to at most 7 other P^'s, the degree of x in is 
at most 28n^ . Let us now write xi, . . . , for the vertices of H. We may assume that 

f{xi;c - e/2) ^ • • • ^ /(x„_i; c - e/2). 

Now going through this order, we greedily pick an independent set I oi H and let Y be the 
number of vertices y £ I which are at distance greater than 2 from the identity and furthermore 
they satisfy ^(Pj,) = o(i^^)- Using (03) we see that the number of y's which do not satisfy 
^(Pj^) = o(j^^) is O(logn). For each such y, using (02) we see that Ty contains a matching 
of size at least n/8. Indeed if M is a maximal matching of size m, then by (02) every vertex 
outside M has at least one neighbour inside M and so there are at least n — m edges with exactly 
one incident vertex inside M. On the other hand by (01) every vertex inside M has at most 
seven neighbours outside M so there are at most 7m edges with exactly one incident vertex 
inside M. So n — m ^ 7m which gives the claimed bound. Therefore, P{By) ^ (1 — p^)^/^. In 
particular the sum over all such y is o(l). 

So since the maximum degree of H is at most 28n^ this way of picking the elements of / 
guarantees that 

1 

^ A dcic - e/2) - oil) ^ /g(c - e/2) - oil) 

2(28n^ + 1)-'^^ ' ' ^ ^ 60 ■'^ ' ' ^ ' 

But for each x we have 

n-'g{x)'-^'^ = n-'5-(^--/2)^ ^ „-(c+25-./2)i^ 
since by (02) we have eiVx) ^ n/2. Thus, by taking 5 = e/8 we obtain 

n-'g{x)'-'/^ ^ g{xr-'-l^ 



and so 

Therefore we get that 



n-V(x,c-e/2) ^ /(:E,c-e/4). 



^/G(c-e/4)^oo. 
bO 

Since also Ey(y — 1) ^ (1 + o(l))(Ey)^ by Chebyshev's inequality we get 

, Var(y) Ey(y-i) i 
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from which we deduce that with high probabihty there are vertices of T which are at distance 
greater than 2 from the identity, thus completing the proof of Theorem 2.2. 



Let G be a group from the family (G^) and let us write 1 for its identity element and n for 
its order. Since the graph T is vertex-transitive it is enough to show that with high probability 
every element of G is at distance at most 2 from the identity. Now fix a non-identity element x 
of G and for each y 7^ let us denote by Ay := Ay{l,x) the event that the edges between 1 
and y and between x and y both appear in F. So we have 



The events Ay are not independent and so to estimate the right hand side of the above inequality 
we will make use of the following inequality of Janson. We refer the reader to [ ] for its proof. 

Theorem 4.1 (Janson's Inequality). Let 0, be a finite set and let {Fi}iQj be subsets ofQ., where 
I is a finite index set. Let R be a random subset of and for each i ^ I let be the event that 
Fi C R. Suppose also that Pr{Ei) ^ e for each i & L. Then 



We will not apply this result directly to all of the events Ay but only to a specially chosen 
subset of them. 

We start by defining an equivalence relation on G by letting y be equivalent to z if and only 
if y = z or y = z"^ . We write [y] for the equivalence class of y and we let Q be the set of all 
equivalence classes. We let Rhe a random subset of O where each equivalence class is chosen 
independently with probability p or 2p—p'^ depending on whether the equivalence class contains 
one or two elements. For each element i of G we let Fi = Fi{x) = {[i], and we let Ei be 

the event that Fi C R. For any / C V{G) \ {1, a;} we have 



Now let / = J U /q U Ji U /2 be any subset of V{G) \ {1, x} with the following properties. 
(PI) For each i £ J we have |-Fj| = 1. 

(P2) For each i £ Lq U Li L) I2 we have |-Fi| = 2. Furthermore exactly t of the equivalence 

classes in Fi have size 2 if and only if i G If. 
(P3) For each j G J and i G /q U Ii U I2 we have that Fi n Fj = 0. 
(P4) For each distinct i,j £ Lq U Li U I2 we have Fi ^ Fj. 

Here the sets /, J, Lq, F, I2 depend on x and later we will have such sets for each x. In order 
to distinguish between them, we will sometimes write /(x), J (x) , Io{x) , Ii{x) and L2{x) instead. 
However if we feel that there is no danger of confusion we will choose to drop the dependence 
on X. 

Suppose now that we are given such sets satisfying properties (P1)-(P4). Observe that for 
each i £ J we have = x, so i 7^ i~^ and therefore Pr(£^j) = 2p — p^. Observe also that for 
each i £ Iq we have Pr(£'j) = p^, for each i £ Ii we have Pr(£'j) = p(2p — p^) and for each 
i £ L2 we have Pr(i?j) = (2p — p^)^. In particular for each t £ {0, 1,2} and each i £ It we 
have Pr(ii^j) = 2*p^ + 0{p^). Finally observe that for each i there are at most 6 other j's for 
which Fi n Fj 7^ 0. Indeed Fi Ci Fj ^ ^ if and only if [j] = [i] or [j] = [xi~^] or [xj~^] = [i] or 
[xj~^] = [xi^^] which (for j 7^ i) happens if and only if j £ {i^^ , xi^^ , ix~^ , ix, ix~^ , xi~^ x} . 



4. Proof of Theorem 2.3 
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So applying Janson's inequality with this set / we get 

Pr((i(l,x) > 2) ^ exp(-2p|J| + p^\J\ - p^\h\ - 2p^\h\ - Ap^\l2\ + 0{np^)) . 

We continue by choosing the sets J{x) that we are going to use. To this end let K{x) = {i G 
G : = x} and observe that for (PI) to hold we must have J{x) C K{x). li x"^ ^ 1 we set 
J(x) = K{x), while if = 1 then we choose J{x) such that it contains exactly one element from 
each pair contained in K. Note that all these pairs really have size two since if it was 

the case that i = then we would have i'^ = 1 ^ x and so i ^ K{x). So with this definition, 
in the first case we have |J(x)| = ^(r^:) while in the second case we have | = i(Tx)/2. In 
particular, in both cases we get 

PT{d{l,x) > 2) ^ (l + o(l))/i(^)v^^-{c+£)(Uo|+2|/i|+4|/2|-|J|)/n_ 

Suppose now that for each x we could find Io{x), Ii{x), l2{x) such that properties (P2)-(P4) are 
satisfied and furthermore we have that 

|Io(x)| + 2|/i(x)| + 4|/2(x)| - I J(x)| ^ (1 + o(l))e(r,). (1) 

Then we would get 

Pr(d(l, x) > 2) ^ (1 + o(l))/(x; c + e/2) 

and so the probability that there is an x with distance greater than 2 from the identity would 
be at most 

^ Pr(d(l, x) > 2) ^ (1 + o(l))/(c + e/2) ^ 

X 

and therefore by the first moment method with high probability the diameter of T would be at 
most 2. 

In fact, because for each x for which | J(x)| ^ n?^^ we have that 

Pr(d(l, x) > 2) ^ (1 + o(l)) exp{-(2p - p'^)\J{x)\} ^ (1 + o(l))e-Pl^(^)l = o(l/n) 

it is enough to find sets /o(x), /i(x), 12(2;) satisfying properties (P2)-(P4) and (1) only for those 
X for which |J(x)| < n^/^. Furthermore in this case we don't even need to satisfy property 
(P3). Indeed as we observed earlier, for each j E Jix) there are at most six other i's with 
Fi n Fj ^ 0. So having found sets Io{x), Ii{x), l2{x) satisfying properties (P2),(P4) and (1) by 
removing at most 6|J| = o{n) elements from each of them we will get sets Iq{x) , I[{x) , H^ix) 
satisfying (P2)-(P4) and since e{Tx) ^ n/l, the inequality (1) will still be satisfied. 

So pick an element x of G which is different from the identity and assume that | J(x)| < n'^^^. 
Let us pick an element y oi G distinct from the identity for which y^ ^ x and 7^ x. 
The condition |J(x)| < n^/^ guarantees that all but o(n) elements y satisfy these properties. 
Our first task is to understand how many neighbours does y have in the graph r^;. Recall that 
A^(y) = {xy, xy~^, x~^y, x~^y~^ ,yx, y~^x, yx~^ ,y~^x~^}. So we want to find the size of this set 
and this of course depends on certain relations that x and/or y may satisfy. All the information 
required is conveyed in Table 1. 



Rl 


x^ = 1 


1,3 


2,4 


5,7 


6,8 


B,E 


B = E 


R2 


y' = i 


1,2 


3,4 


5,6 


7,8 


* 


B = C 


R3 


xy = yx 


1,5 


2,6 


3,7 


4,8 


C,D 


C = D 


R4 


{xyY = x^ 


1,6 


2,5 


3,8 


4,7 


B,D 


B = D 


R5 


[xyY = y^ 


1,7 


2,8 


3,5 


4,6 


A,F 


A = F 


R6 


{xyY = 1 


1,8 


4,5 










R7 


{xy-'Y = 1 


2,7 


3,6 






* 


D = E 


R8 


x^ = 


2,3 


6,7 






C,E 


C = E 


R9 


x^ = y~'^ 


1,4 


5,8 











Table 1. 
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We can use Table 1 to interpret in which instances the possible neighbours of y are actually 
equal. Here we list the possible neighbours in the order xy, xy~^, x~^y, x~^y^^, yx, y~^x, 
yx~^ , y~^x~^ . For example, the first row of the table says that whenever x^ = 1 then the first 
expression xy is equal to the third expression x~^y, the second is equal to the fourth and so on. 
The last two columns of Table 1 will be explained later. We can now use Table 1 to construct 
Table 2. 



1 

1 




123456 


** 


n TP 


o 
I 


T? 1 "DO 

Ki,K2 


1 OQ/1 1 C^f^VQ 

izo4|ob (0 


* 


nr^ TP 

dLj hi 


6 


TD 1 "DO 

Ki,Ko 


io5 / |z46o 




A TP\ T:> TP \ 

Ar \r>rj\LyJJ 


A 

4 


Ki,K4 


1 Q^IC 1 O/l 
iODo|Z40 ( 


* 


t? n z? 







1 Q 1 O/l 1 1 

io|z4|ob| (o 




Id hi 





TZ> O "D O 


iz56|o4(^5 


* 




1 
i 


T? O "D 

KZ,KO 


1 OVC 1 Q /I C^f^ 
iZ r o|o40D 


** 




Q 
O 


"D O 

Kz 


1 O 1 Q /I 1 1 

iz|34|56| /8 


* 




f\ 

y 


"DO X) c Tyn 


i458|z36 1 


* 


r^TP 


1 n 
iU 


H? Q "D ^ 


1 /I CO 1 O/^ 1 Q V 

i4oo|ZD|o/ 




Lj L) 


ii 


TZ> Q "D V 

Ko,K/ 


1 1 /) Q 1 OQ^^ V 
iO|4o|ZOD / 


* 


Li L)h 


12 


R3 


15|26|37|48 




CD 


13 


R4,R5 


1467|2358 




AF\BD\CE 


14 


R4 


16|25|38|47 




BD 


15 


R5 


17|28|35|46 




AF 


16 


R6,R7 


18|27|36|45 


* 




17 


R6,R8 


18|23|45|67 




CE 


18 


R6 


18|45|2|3|6|7 






19 


R7,R9 


14|27|36|58 


* 




20 


R7 


1|4|5|8|27|36 


* 




21 


R8,R9 


14|23|58|67 




CE 


22 


R8 


1|4|5|8|23|67 




CE 


23 


R9 


14|58|2|3|6|7 






24 




1|2|3|4|5|6|7|8 







Table 2. 



From Table 2 we can read how many neighbours each y has depending on which relations 
X and y satisfy. For example the third row says that if x and y satisfy the relations in rows 
1 and 3 of Table 1 and all the relations that can be deduced by them but no relation from 
any other row, then y has exactly two neighbours. This is because the first, third, fifth and 
seventh expression in the above list are all equal and also the second, fourth, sixth and eight 
relation are again all equal but distinct from the others (as no other relation holds). The last 
two columns of Table 2 will be discussed later. We now explain how the second of Table 2 is 
constructed. (The construction of the third column from the second is obvious.) In principle. 
Table T2 should have 2^ rows, one for each subset of relations that appear in Table 1. However 
since some of these relations together imply some others we can cut this down a lot. We start 
by setting all relations one by one to be true until they force all other relations to be true. 
For example, once the relations in rows R1-R3 of Table 1 are true, then so are all the other. 
This creates the (second column of the) first row of Table 2. Having constructed the A;-th row 
of Table 2 we construct its (A; + l)-th row as follows: If in the second column of the /c-th row 
we have the elements Rii, ■ ■ ■ ,Ris with ii < ■ ■ ■ < ig, then for the {k + l)-th row we set the 
relations Rii, . . . , Ris-i as true, all other relations Rit with t ^ s as false and then we continue 
by setting all other relations one by one as true until these force the rest to be either true or 
false. 
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Recall that we do the above calculations only for elements y for which x ^ So when 

computing -^(ra;) we will miss all neighbours of such y's. However we already observed that 
there are at most 2| J(a;)| < 2n^/^ = o{n) such y so the total contribution that we miss is only 

Our second task now is for each y with y ^ l,x,x ^ and y^,y ^ ^ x to understand the 
sizes of the equivalence classes of Fy and also how many distinct z's are there for which 
Fy 7^ Fz- We begin by recalling that Fy n F^ is not empty only in the cases where z = 
y^^,yx,y~^x,xy~^,yx~^,xy~^x in which cases we have Fy-i = {[y],[xy]}, Fy^ = {[y\,[yx\], 
Fy^x = {[y],[y'^x]}, F^y-i = {[xyx-^],[xy-^]}, Fy^-i = {[x'^y-'^],[xy-^]} and F^y-i^ = 
{[xy~^x],[xy^^]}. Let us write A = y~^,B = yx,...,F = xy~^x. Now let us go back to 
Table 1. The last column denotes equalities between A, B, . . . , F. For example the first row 
says that whenever x^ = 1 then B = E (i.e. yx = yx~^) and no other relation must necessarily 
hold. The second to last column says for which z we actually have Fy = Fz- For example again 
from the first row we read that if = 1 then we must have Fy = Fb = Fe but no other 
equality must necessarily hold. The stars in the second and seventh row denote the fact that 
some of the equivalence classes of Fy will have size one instead of two. For example if = 1 
then the class \y] of Fy has size one. 

We now discuss the last two columns of Table 2. The second to last column denotes how many 
equivalence classes of Fy have size one. The number of them is the number of stars appearing 
in this column. The last column denotes which F^'s are neighbours to Fy. For example in 
the first row we have that all Fb, Fq, F^, Fe are equal to Fy and moreover we actually have 
B = C = D = E. While in the third row we see that all Fa, ■ ■ ■ , Fp are equal to Fy but in fact 
the only equalities between A,B,...,F are A = F, B = E and C = D. 

Let us now define a graph R{x) on the set of all y with y ^ l,x, x~^ and x ^ y'^,y~'^ in which 
we join y to z if and only if Fy = Fz- We see from Table 2 that this graph is a union of disjoint 
i^2's and K/^s. To satisfy conditions (P2) and (P4) it is necessary and sufficient the the elements 
we pick to form Iq{x) U Ii{x) U l2{x) are an independent set of R{x). We now go through each 
K2 and ^4 and pick one element at random. We calculate E(|/o(a;)| + 2|/i(x)| + 4|l2(a;)|). The 
first row of Table 2 says that each y of this type contributes ^ • 1 = ^ to this expectation. Here 
the 1/2 comes from the fact that this element is chosen with probability 1/2, while the 1 comes 
from the fact that this element will belong to Iq{x). Because every such y has exactly one 
neighbours, it also contributes 1/2 to the count of -E(r^.) = \ NY^{y)\/2. We now go through 
each row and we discover that the contribution to E(|Io(x)| + 2|/i(x)| + 4|/2(3;)|) in each one of 
them is equal to the contribution to -^(r^.) unless in the following instances: 

• In row 10 we get a contribution of 2 instead of 3/2. 

• In row 11 we get a contribution of 1 instead of 3/2. 

• In row 18 we get a contribution of 4 instead of 3. 

• In row 20 we get a contribution of 2 instead of 3. 

• In row 22 we get a contribution of 2 instead of 3. 

• In row 23 we get a contribution of 4 instead of 3. 

However, the number of y's which satisfy the relations of row 10 is equal to the number of y's 
which satisfy the relations of row 11. Indeed one can check that y satisfies the relations of row 
10 if and only if y^^ satisfies the relations of row 11. Similarly, again under the transformation 
y I—)- y~^, we see that the number of y's which satisfy the relations of row 18 is the same as 
those which satisfy the relations of row 20 and the same happens with rows 22 and 23. So we 
have 

E(|/o(x)|+2|/i(x)|+4|/2(x)|) ^ (l + o(l))^(r,) 

where the 1 + o(l) factor comes from the fact that we did not consider y's with x = y^ 01 
X = y~^. In particular there is a choice of y's for which 

\Io{x)\ + 2\h{x)\ + A\h{x)\ ^ (1 + o{l))E{T,). 
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This completes the proof of Theorem 2.3. 

5. Proofs of Theorems 1.2-1.5 

We begin with the proof of Theorem 1.3 which is the simplest of the four. To avoid unneces- 
sary repetitions both in this and the proofs that follow, whenever we refer to the neighbours of 
a vertex in a graph we will mean the neighbours of this vertex in the underlying simple graph. 

(1 — c)n 

Proof of Theorem 1.3. Let G = (6*2)"' x (Cm) where m = [2 ^ J and let us write N for the 
order of G. So 2" ~ N'^ and m ~ N^~'^. We now compute f{x; c') for each x £ G. 

(a) If X is of the form {x' ,1) where 1 is the identity in Cm, then = 1, r^; has no loops, 
and each y has exactly two neighbours in Fx, namely xy and xy~^ unless = 1. 
Since there are at most 2^~^^ = o{N) such y's we have e{rx) = (1 + o{l))N and so 
f{x, c') = N~^'~^°^^\ Since there are 2" ~ N'^ such elements x the total contribution of 
these elements to /(c') is N^~'^ +°W which tends to infinity if c' < c and tends to if 
c' > c. 

(b) If X is not as in (a) but still satisfies = 1, then each y has exactly two neighbours in 
r^^, namely xy and xy~^ unless G {l,x}. Since there are at most o{N) such y's we 
have eiVx) = (1 + o(l))iV and so f{x,c') = Ar-c'+o(i). Since there are at most 2" ~ N" 
such elements x the total contribution of these elements to /(c') is N'^"^ +"W which 
tends to if c' > c. 

(c) If X is not as in (a) or (b) then all but o{N) y's have exactly four neighbours in r^, namely 
xy,xy-^,x-^y,x-'^y-^. Therefore e(Tx) = (2 + o(l))iV and so f{x,c') ^ Ar-2c'+o(i)_ 
particular the total contribution of such elements to f{c') is at most N^^"^^ +°W which 
(since c ^ 1/2) tends to if c' > c. 

Combining the calculations in (a)-(c) we see that /(c') tends to infinity if c' < c and to if 
c' > c as required. □ 

We now proceed with the proof of Theorem 1.2. In the proof we will repeatedly make use of the 
facts that for each element x of Sn there are at most elements y with y'^ = x and (un- 

less X is the identity) at most (n— 2)! elements which commute with x. (The second assertion is a 
simple exercise. To prove the first assertion, it is enough by Exercise 7.69.c] to prove it when 
X is the identity. A proof of this is included in [ ].) In particular for each element x of Sn there 
are at most o(n!) elements y for which the set {xy,xy~^ , x~^y, x~^y~^ ,yx,y~^x,yx^^ ,y^^x~^} 
has size less than 8. The only case where this might not be immediately obvious is that for 
each X there are at most o{n) elements y with (xy)^ = y^. But if there is a z with (xz)^ = 
then z~^xz = x~^ . So x~^ belongs to the equivalence class of x and so the number of y's with 
(xy)^ = is the same as the number of y's for which y~^xy = x, i.e. it is the same as the 
number of elements which commute with x. 

2c 

Proof of Theorem 1.2. Let G = Sn^ Gm where m = [n!i-2cj and let us write for the order 
of G. So n! ~ N^-'^" and m ~ A^2c_ 

We begin by computing /i(x) for each x £ G. Since for each x G Gm there are at most two y's 
in Gm with y"^ = x and for each x £ Sn there are at most n]^/'^~^°W y's in Sn with = x then 
for each element x £ G there are at most A^i/2-c+o(i) gigj^ents y with y^ = x. In particular 
£(r^) = 0(Ari/2-c+o(i)) ^^-^^ 

We continue by computing y(x) and thus /(x; c') for each x £ G. 

(a) There is at most one (non-identity) element x of the form (1, x'), where 1 is the identity 
of Sn, with x^ = 1. It contributes o(l) to f{c'). 

(b) Suppose X is of the form (l,x') where 1 is the identity of 5„ and x"^ 7^ 1. Then all but 
o{N) elements y have exactly four neighbours in Tx, namely xy, xy~^ , x^^y and x~^y~^. 
Therefore e{Tx) = (1 + o(l))A^ and so f{x,c') = A^~2^+°(^). Since there are either m 
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or m — 1 such elements, the total contribution of these elements to f{c') is A^^'^"^'^ 
which tends to infinity if c' < c and tends to if c' > c. 

(c) If X is not of the above forms but = 1, then all but o{N) elements y have exactly 
four neighbours in T^, namely xy, xy^^ ,yx and y~^x. Therefore e{Tx) = (2 + o{l))N 
and so /(x,c') = Ar-2c'+o(i)_ gi^ce there are (at most) n!i/2+o(i) ^ ^i/2-c+o(i) g^^j^ 
elements the total contribution of these elements to /(c') is at most A^i/2-c-2c -^^j^jf^j^ 
(since c ^ 1/4) tends to if c' > c. 

(d) If x is not of the above forms then all but o{N) elements y have exactly eight neighbours 
in r^. Therefore e{Tx) = (4 + o{l))N and so f{x,c^) = A^~^^+°(i). In particular the 
total contribution of these elements to f{c') is at most N'^~^'^ +°W which (since c ^ 1/4) 
tends to if c' > c. 

Combining the calculations in (a)-(d) we see that /(c') tends to infinity if c' < c and to if 
c' > c as required. □ 

(4 — 3c)n 

Proof of Theorem 1.4- Let G = (C2)"' x {D2m) where m = [2 J and -D2m is the dihedral 
group of order 2m and let us write N for the order of G. So 2" = 0(A^^'^/^) and 2m = 
Q(JV^~'^'^/^). We now compute f{x;c') for each x € G. 

(a) Suppose x is of the form (x', 1) where 1 is the identity element of and x' is a non- 
identity element of C2 ■ Since no element of G squares to x then has no loops and so 
h(x) = 1. Note that x is central satisfying = 1 and therefore every y has at most two 
neighbours in Tx, namely xy and xy^^. Moreover it has exactly one neighbour if and 
only if y is an involution. Since has either m + 1 or m + 2 involutions depending on 
the parity of m we have that G has G(A^/2) involutions and so e{Tx) = (3/4 + o{l))N 
and fix;c') = Ar-3/4c'+o(i)^ g-^^^^ ^^leie are 2" = e(iV3^/'^) such elements, the total 
contribution of these elements to f{c') is ]\[^^/^-^^ which tends to infinity if c' < c 
and tends to if c' > c. 

(b) Suppose X is of the form (l,x') where 1 is the identity in (C2)" and x' is a non-identity 
element of I?2m representing a rotation of the regular m-gon. Then the equation y"^ = x 
has at least 2" = 6(A^^^/^) solutions. Since c > 1, we deduce that h{x) = e~^("^''*) and 
so f{x; d) = o{l/N). So the total contribution of these elements to /(c') is o(l). 

(c) For every other element x we have that Vx contains no loops. Moreover, either we 
have that x is not an involution, in which case every y has at least two neighbours in 
r^,, namely xy and x~^y, or x is an involution in which case all but at most 0(2") = 
0(N^/^) elements y commute with x with the rest having at least two neighbours in 
r^^, namely xy and yx. So in both cases we have e{Tx) ^ (1 + o{l))N, and therefore 
f{x,c') = = o{l/N). So the total contribution of these elements to f{c') is also 
o(l). 

Combining the calculations in (a)-(d) we see that /(c') tends to infinity if c' < c and to if 
c' > c as required. □ 

As we mentioned in the introduction the proof of Theorem 1.5 is based on the fact that there 
are finitely many values that can appear as the proportion of involutions of a finite group in 
the interval [1/2, 1]. The earliest reference we could locate for this result is [ ]. Here, we will 
make use of the following theorem of Wall [10] which classifies all groups for which more than 
half of their elements are involutions. 

Theorem 5.1. Let G be a group having exactly (1/2 + c)\G\ involutions, where c > 0. Then 
there is a positive integer n such that c = l/2n. Moreover, G is the direct product of copies of 
G2 together with a group H which has one of the following forms: 

(I) H contains an abelian subgroup Hi of index 2 and an element g with gh = h^^g for 
every h €z H. Moreover G2 does not appear as a factor in the decomposition of Hi as 
products of cyclic groups of prime power orders. 



THE RANGE OF THRESHOLDS FOR DIAMETER 2 IN RANDOM CAYLEY GRAPHS 



13 



(II) H = DsXDs. 

(III) H is generated by involutions c, xi,yi, . . . , Xr,yr for which all of them commute with 
each other apart from the pairs {xi, yi} for 1 ^ i ^ r which satisfy {xiyi)^ = c. 

(IV) H is generated by involutions c,xi,yi, . . . ,Xr,yr for which all of them commute with 
each other apart from the pairs {c, Xj} for 1 ^i ^r which satisfy (cxj)^ = yi. 

In particular, for each d > 0, there is a finite set S{d) of groups such that every group G with 
more than {1/2 + d)\G\ involutions is a direct product of copies 0/C2 together with a group from 
S{d). 

Armed with this result we can move on to the proof of Theorem 1.5. 

Proof of Theorem 1.5. Suppose there is a family (Gfc) of groups for which c > 4/3 is a threshold 
for diameter 2. Let c' = (c + 4/3)/2. By Theorem 2.1 we have that /g^(c') — )> 00. Let G be a 
group from the family {Gk) for which /g(c') is sufficiently large and let us write for its order 
and aN for the number of its involutions. We now compute f{x; c') for each x € G. We may 
assume that the equation x = y^ has at most N^^"^ solutions as otherwise i{Tx) ^ N'^^'^ which 
implies that f{x,c') = o{l/N) and the total contribution of such elements to f{c') is o(l). 

(a) Suppose that x is not an involution. Then each y has at least two neighbours in T^., 
namely xy and x'^y and so e(Tx) ^ N. It follows that f{x;c') ^ = o{\/N) and 
so the total contribution of these elements to /(c') is o(l). 

(b) Suppose that x is an involution which is not central. Let C{x) = {y £ G : yx = xy} 
be the centraliser of x and suppose that it has size /3N. Since C{x) is a subgroup of G 
and since x is not central we must have /3 ^ 1/2. The only y's which have exactly one 
neighbour in must belong to G{x) so 

e(r,) ^ (/3 + 2(1 -f3)+ o(l))n/2 = (1 - /3/2 + o{l))N ^ (3/4 + o{l))N. 

It follows that f{x; c') ^ N-^c'/i+o{i) ^ o{l/N). So again the total contribution of these 
elements to f{c') is o(l). 

(c) If now X is a central involution, then each y which does not satisfy = x has either one 
or two neighbours in T^x depending on whether y is an involution or not. So eij^x^ — 
(1 — a/2 + 0(1)) A^. In particular, if a ^ 1/2 + d, then the total contribution of these 
elements to f{c') is at most A^i~^'(3/4-fi/2+o(i)) -^j^j^j^ q^^i^ if d > is sufficiently small. 

Combining the calculations in (a)-(c) we see that unless the proportion of involutions in G is 
at least 1/2 + d for some sufficiently small but fixed d, then f{c') tends to 0, a contradiction. 

So we can now use Theorem 5.1 to see that G must be a direct product of copies of C2 together 
with a group H from a finite set S{d) of groups. In particular, if M{d) is the largest cardinality 
of a group in S{d) then we have that G contains at least N/M{d) central involutions, namely 
all elements of the form (x, 1) where x belongs to the direct product of the C2's and 1 is the 
identity element of H. In particular, it contains at least G(A^) central involutions x for which 
the equation = x has at least n^^^ solutions. For each such x we have ^(x) = (l + o(l)) and so 
by (c) the total contribution of these elements to f{c') is ©(A^-'^"'^ This tends to infinity 

if c' < and tends to if c' > i^^. So c = 2^ which is equal to 3^"-^ if a = 1/2 + l/2n. 
To complete the proof of the theorem, we just observe that the given family has a proportion 
of 1/2 + l/2n involutions as so does -D4n- D 

6. Conclusion and open problems 

Even though we stated the results with fixed e, the same proofs work to show the following 

Theorem 6.1. Let c be a positive real number and let (Gk) be a family of groups of order 
with tending to infinity. Then c is the threshold for diameter 2 for this family if and only if 
fOkick) 00 whenever c - Ck = ^( ^"w^"'' ) and fcki^k) whenever Ck - c = w(i^^)- 
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As in the case of the diameter of random graphs (see e.g. [■:, Theorem 10.10]) one cannot 
expect to improve the a;( ^^g^^ ) term. Note however that for achieving diameter greater than 2 

we needed an a;( '°f '"^"'^ ) term rather than an coir^ — ) term. It would be interesting to check 
whether this can be improved. 

It would be also interesting to check whether loops in Tx really do make a difference or 
whether the same results hold if /g(c) is replaced by /g(c) = ^^ZxeGXli} doi^Y- 

Finally we mention a related group theoretic problem that arises. As we have already men- 
tioned the problem of which numbers in (1/2,1] can appear as proportions of involutions of 
a finite group has been well studied and the groups with these proportions have been charac- 
terised. But what about numbers in the interval [0,1/2]? Is it true that for every number in 
this interval there are finite groups whose proportion of involutions is sufficiently close to this 
number, or are other gaps in this interval as well? We haven't been able to locate this problem 
in the literature but we believe it is an interesting one to investigate. 
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